Speech Data Clustering Based on Phoneme Error Trend for Unsupervised Acoustic Model Adaptation

نویسندگان

Taichi Asami

Satoshi Kobashikawa

Hirokazu Masataki

Osamu Yoshioka

Satoshi Takahashi

چکیده

Unsupervised cluster adaptive training of acoustic models offers promise in improving recognition accuracy, especially for speech recognition systems that store massive sets of speech samples from unknown people. How to classify the variety of acoustic characteristics is an important problem in adaptation sample clustering. We propose a novel speech sample clustering method that focuses on the phoneme error trend in each speech sample. The proposed method classifies adaptation samples in terms of the trend of phoneme discrimination in each sample, and represents each sample as a compact phoneme error trend vector whose dimension is at most the number of phonemes. Experiments illustrate that the phoneme error trend vectors have enough expressiveness to classify acoustic characteristics effectively, and are compact enough to provide robustness against unknown data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised acoustic model adaptation based on phoneme error minimization

In this paper, a new decoding method for unsupervised acoustic model adaptation is presented. In unsupervised adaptation framework, the effectiveness of adaptation process is greatly affected by the mis-recognized labels. Therefore, selection of the adaptation data guided by the confidence measures is effective in unsupervised adaptation. We propose phoneme error minimization framework for exac...

متن کامل

Rapid unsupervised adaptation using frame independent output probabilities of gender and context independent phoneme models

Business is demanding higher recognition accuracy with no increase in computation time compared to previously adopted baseline speech recognition systems. Accuracy can be improved by adding a gender dependent acoustic model and unsupervised adaptation based on CMLLR (Constrained Maximum Likelihood Linear Regression). CMLLR-based batch-type unsupervised adaptation estimates a single global trans...

متن کامل

Unsupervised adaptation for acoustic language identification

Our system for automatic language identification (LID) of spoken utterances is performed with language dependent parallel phoneme recognition (PPR) using Hidden Markov Model (HMM) phoneme recognizers and optional phoneme language models (LMs). Such a LID system for continuous speech requires many hours of orthographically transcribed data for training of language dependent HMMs and LMs as well ...

متن کامل

Unsupervised Acoustic Model Adaptati Minimizatio

متن کامل

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Speech Data Clustering Based on Phoneme Error Trend for Unsupervised Acoustic Model Adaptation

نویسندگان

چکیده

منابع مشابه

Unsupervised acoustic model adaptation based on phoneme error minimization

Rapid unsupervised adaptation using frame independent output probabilities of gender and context independent phoneme models

Unsupervised adaptation for acoustic language identification

Unsupervised Acoustic Model Adaptati Minimizatio

Allophone-based acoustic modeling for Persian phoneme recognition

عنوان ژورنال:

اشتراک گذاری